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PATENT LING.001.01US 
<160>NUMBER OF SEQUENCES : 6 
<170> Patentln Version 3.0 



<210>INFORMATION FOR SEQ ID NO:l 

5 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 { D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: DNA coding sequence for HIV capsid protein Pr55 

15 



U <400> SEQUENCE DESCRIPTION: SEQ ID NO:l 



a j=s 

f ^ 


ATGGGTGCGA 


GAGCGTCGGT 


ATTAAGCGGG 


G G AG AA.T TAG 


ATAAATGGGA 


AAAAATTCGG 


60 




TTAAGGCCAG 


GGGGAAAGAA 


AAAATATAAG 


T T AAAAC AT A 


TAG TAT G GG C 


AAGCAGGGAG 


120 


□20 


CTAGAACGAT 


TCGCAGTCAA 


TCCTGGCCTG 


T T AG AAACAT 


CAGAAGGCTG 


CAGACAAATA 


180 




I I GGGACAGC 


T AC AG C CAT C 


CCTTCAGACA 


G GAT CAGAAG 


AACTTAGAT C 


ATTATATAAT 


240 


if ~: 
=r 


ACAGTAGCAA 


CCCTCTATTG 


T GT AC AT C AA 


AGGATAGATG 


TAAAAGACAC 


CAAGGAAGCT 


300 


H 

r — ■ 
f =t 

\\ i 


TTAGAGAAGA 


TAGAGGAAGA 


GCAAAACAAA 


AGTAAGAAAA 


AG G CACAGC A 


AGCAGC AG CT 


360 


GCAGCTGGCA 


CAGGAAACAG 


CAGCCAGGTC 


AG CCAAAAT T 


AC C CT AT AG T 


GCAGAACCTA 


420 




CAGGG GCAAA 


TGGTACATCA 


G G C CAT AT CA 


CCTAGAACTT 


TAAATGCATG 


GGTAAAAGTA 


480 


IU 


GTAGAAGAAA 


AGGCTTTCAG 


CCCAGAAGTA 


AT AC CCATGT 


TTTCAGCATT 


ATCAGAAGGA 


540 




GCCACCCCAC 


AAGATT TAAA 


CAC CAT GCTA 


AACACAGTGG 


GGGGACATCA 


AGCAGC CATG 


600 




CAAAT GTTAA 


AAGAGACTAT 


CAATGAGGAA 


GCTGCAGAAT 


GGGATAGAGT 


GCAT CCAGTG 


660 




CATGCAGGGC 


CTATTGCACC 


AGGCCAAATG 


AGAGAACCAA. 


GGGGAAGTGA 


CAT AGCAG GA 


720 


30 


ACTACTAGTA 


CCCTTCAGGA 


ACAAATAGGA 


TGGATGACAA 


ATAATCCACC 


TATCCCAGTA 


780 




GGAGAAATCT 


AT AAAAG AT G 


GATAATCCTG 


G GATTAAAT A 


AAATAGTAAG 


AATGTATAGC 


840 




CCTACCAGCA 


TTCTGGACAT 


AAGACAAGGA 


CCAAAGGAAC 


CCTTTAGAGA 


TTATGTAGAC 


900 




CGGTTCTATA 


AAACTCTAAG 


AGCCGAACAA 


GCTTCACAGG 


ATGTAAAAAA 


TTGGATGACA 


960 




GAAACCTTGT 


TGGTCCAAAA 


TGCAAACCCA 


GATT GTAAGA 


CTATTTTAAA 


AGCATT GGGA 


1020 


35 


CCAGCAGCTA 


CACTAGAAGA 


AATGATGACA 


GCATGTCAGG 


GAGTGGGGGG 


ACCCGGCCAT 


1080 




AAAGCAAGAG 


TTTTGGCTGA 


AGC CAT GAGC 


CAAGTAACAA 


AT C CAGCTAA 


CAT AAT GAT G 


1140 




CAGAGAGGCA 


ATTTTAGGAA 


CCAAAGAAAG 


ACTGTTAAGT 


GTTTCAATTG 


T GG CAAAGAA 


1200 




GGGCACATAG 


CCAAAAATTG 


CAGGGCCCCT 


AGGAAAAAGG 


GCTGTTGGAG 


ATGT GGAAGG 


1260 




GAAGGACACC 


AAATGAAAGA 


TTGCACTGAG 


AGACAGGCTA 


ATTTTTTAGG 


GAAGATCTGG 


1320 


40 


CCTTCCTACA 


AGGGAAGGCC 


AGGGAATTTT 


CTTCAGAGCA 


GACCAGAGCC 


AACAGCCCCA 


1380 




CCAGAAGAGA 


GCTTCAGGTT 


T GGGGAG GAG 


AAAACAACTC 


CCTCTCAGAA 


GCAGGAGCCG 


1440 



51 



PATENT 

ATAGACAAGG AACTGTATCC TTTAACTTCC 
TCACAATAAG GATAGGGGGG CAACTAAAGG 
CAGTATTAGA AGAAAT GAAT TTGCCAGGAA 



LING.001.01US 

CTCAGATCAC TCTTTGGCAA CGACCCCTCG 1500 
AAGCTCTATT AGATAGAGGA GCAGATGATA 1560 
AATGGAAACC AAAAATGATA 1610 



<210> INFORMATION FOR SEQ ID NO : 2 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: PRT 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: peptide fragment of host cell (wheat germ) protein 

HP68 



<400> SEQUENCE DESCRIPTION: SEQ ID NO : 2 



Pro Arg Pro Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala Arg Val 

15 10 15 

lie Arg Ser Leu Leu Arg Ser Asn 



<210> INFORMATION FOR SEQ ID NO : 3 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: Degenerate oligonucleotide C-terminal peptide 
sequence of WGHP68 



<400> SEQUENCE DESCRIPTION: SEQ ID NO : 3 



AT GAAT T C AC TGGGACTGCG GATAGATTAC TGGTACTGGG GATC 4 4 



<210> INFORMATION FOR SEQ ID NO: 4 
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PATENT 



LING.001.01US 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: Degenerate oligonucleotide C-terminal peptide 
sequence of WGHP68 

<4 00> SEQUENCE DESCRIPTION: SEQ ID NO : 4 

ATGAATTCAC TGGGCTCTGA TAGATTACTG GTACTGGGGA TC 42 



<210> SEQ ID NO:5 
<211> Length: 604 
<212> Type: PRT 

<213> Organism: Triricum aestivum 
<400> Sequence 5 

Met Ala Asp Arg Leu Thr Arg lie Ala lie Val Ser Glu Asp Lys Cys 

15 10 15 

Lys Pro Lys Lys Cys Arg Gin Glu Cys Lys Lys Ser Cys Pro Val Val 

20 25 30 

Lys Thr Gly Lys Leu Cys lie Glu Val Ser Pro Val Ala Lys Leu Ala 

35 40 45 

Phe lie Ser Glu Glu Leu Cys lie Gly Cys Gly lie Cys Val Lys Lys 

50 55 60 

Cys Pro Phe Asp Ala lie Glu lie lie Asn Leu Pro Lys Asp Leu Glu 
65 70 75 80 

Lys Asp Thr Thr His Arg Tyr Gly Pro Asn Thr Phe Lys Leu His Arg 

85 90 95 

Leu Pro Val Pro Arg Pro Gly Gin Val Leu Gly Leu Val Gly Thr Asn 

100 105 110 

Gly lie Gly Lys Ser Thr Ala Leu Lys Val Leu Ala Gly Lys Leu Lys 

115 120 125 

Pro Asn Leu Gly Arg Phe Lys Asn Pro Pro Asp Trp Gin Glu lie Leu 
130 135 140 
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PATENT LING.00 1 .0 1 US 

Thr Tyr Phe Arg Gly Ser Glu Leu Gin Asn Tyr Phe Thr Arg lie Leu 
145 150 155 160 

Glu Asp Asn Leu Lys Ala lie lie Lys Pro Gin Tyr Val Asp His lie 
165 170 175 

5 Pro Lys Ala Val Gin Gly Asn Val Gly Gin Val Leu Glu Gin Lys Asp 
180 185 190 

Glu Arg Asp Met Lys Asn Glu Leu Cys Val Asp Leu Glu Leu Asn Gin 

195 200 205 

Val lie Asp Arg Asn Val Gly Asp Leu Ser Gly Gly Glu Leu Gin Arg 
10 210 215 220 

Phe Ala lie Ala Val Val Ala Val Gin Ser Ala Glu lie Tyr Met Phe 
225 230 235 240 

Asp Glu Pro Ser Ser Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala 
245 250 255 

15 Arg Val lie Arg Ser Leu Leu Arg Ser Asn Ser Tyr Val lie Val Val 
260 265 270 

Glu His Asp Leu Ser Val Leu Asp Tyr Leu Ser Asp Phe lie Cys Cys 
»p 275 280 285 

5r; Leu Tyr Gly Lys Pro Gly Ala Tyr Gly Val Val Thr Leu Pro Phe Ser 
fJO 290 295 300 

W Val Arg Glu Gly lie Asn He Phe Leu Ala Gly Phe Val Pro Thr Glu 
]~ 305 310 315 320 

|=5. Asn Leu Arg Phe Arg Asp Glu Ser Leu Thr Phe Lys He Ala Glu Thr 
□ 325 330 335 

125 Gin Glu Ser Ala Glu Glu Val Ala Thr Tyr Gin Arg Tyr Lys Tyr Pro 

340 345 350 

Thr Met Ser Lys Thr Gin Gly Asn Phe Lys Leu Ser Val Val Glu Gly 

355 360 365 

Glu Phe Thr Asp Ser Gin He Val Val Met Leu Gly Glu Asn Gly Thr 
30 370 375 380 

Gly Lys Thr Thr Phe He Arg Met Leu Ala Gly Leu Leu Lys Pro Asp 
385 390 395 400 

Thr Met Glu Gly Thr Glu Val Glu He Pro Glu Phe Asn Val Ser Tyr 
405 410 415 

35 Lys Pro Gin Lys He Ser Pro Lys Phe Gin His Pro Val Arg His Leu 
420 425 430 

Leu His Ser Lys He Arg Asp Ser Tyr Thr His Pro Gin Phe Val Ser 

435 440 445 

Asp Val Met Lys Pro Leu Gin He Glu Gin Leu Met Asp Gin Glu Val 
40 450 455 460 

He Asn Leu Ser Gly Gly Glu Leu Gin Arg Val Ala Leu Cys Leu Cys 
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465 470 475 480 

Leu Gly Lys Pro Ala Asp lie Tyr Leu lie Asp Glu Pro Ser Ala Tyr 

485 490 495 

Leu Asp Ser Glu Gin Arg lie Val Ala Ser Lys Val lie Lys Arg Phe 
5 500 505 510 

lie Leu His Ala Lys Lys Thr Ala Phe lie Val Glu His Asp Phe lie 

515 520 525 

Met Ala Thr Tyr Leu Ala Asp Lys Val lie Val Tyr Glu Gly Leu Ala 
530 535 540 

10 Ser lie Asp Cys Thr Ala Asn Ala Pro Gin Ser Leu Val Ser Gly Met 
545 550 555 560 

Asn Lys Phe Leu Ser His Leu Asp lie Thr Phe Arg Arg Asp Pro Thr 

565 570 575 

Asn Tyr Arg Pro Arg lie Asn Lys Leu Glu Ser Thr Lys Asp Arg Glu 
,15 580 585 590 

|3 Gin Lys Asn Ala Gly Ser Tyr Tyr Tyr Leu Asp Asp 
l * 595 600 

a ~ 
'£ __; 

IU <210> SEQ ID NO: 6 

<211> LENGTH: 599 



<212> TYPE: PRT 

<213> ORGANISM: Homo sapians 

<4 00> SEQUENCE 6 



lQ.5 Met Ala Asp Lys Leu Thr Arg lie Ala lie Val Asn His Asp Lys Cys 

si \ 

1 5 10 .15 

Lys Pro Lys Lys Cys Arg Gin Glu Cys Lys Lys Ser Cys Pro Val Val 

20 25 30 

Arg Met Gly Lys Leu Cys lie Glu Val Thr Pro Gin Ser Lys lie Ala 
30 35 40 45 

Trp lie Ser Glu Thr Leu Cys lie Gly Cys Gly lie Cys lie Lys Lys 

50 55 60 

Cys Pro Phe Gly Ala Leu Ser lie Val Asn Leu Pro Ser Asn Leu Glu 
65 70 75 80 

35 Lys Glu Thr Thr His Arg Tyr Cys Ala Asn Ala Phe Lys Leu His Arg 

85 90 95 

Leu Pro lie Pro Arg Pro Gly Glu Val Leu Gly Leu Val Gly Thr Asn 

100 105 110 

Gly lie Gly Lys Ser Ala Ala Leu Lys lie Leu Ala Gly Lys Gin Lys 
40 115 120 125 

Pro Asn Leu Gly Lys Tyr Asp Asp Pro Pro Asp Trp Gin Glu lie Leu 
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PATENT LING. 00 1 .0 1 US 

130 135 140 

Thr Tyr Phe Arg Gly Ser Glu Leu Gin Asn Tyr Phe Thr Lys lie Leu 
145 150 155 160 

Glu Asp Asp Leu Lys Ala lie lie Lys Pro Gin Tyr Val Ala Arg Phe 
5 165 170 175 

Leu Arg Leu Ala Lys Gly Thr Val Gly Ser lie Leu Asp Arg Lys Asp 

180 185 190 

Glu Thr Lys Thr Gin Ala lie Val Cys Gin Gin Leu Asp Leu Thr His 
195 200 205 

10 Leu Lys Glu Arg Asn Val Glu Asp Leu Ser Gly Gly Glu Leu Gin Arg 
210 215 220 

Phe Ala Cys Ala Val Val Cys lie Gin Lys Ala Asp He Phe Met Phe 
225 230 235 240 

Asp Glu Pro Ser Ser Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala 
15 245 250 255 

He Thr He Arg Ser Leu He Asn Pro Asp Arg Tyr He He Val Val 
Q 260 265 270 

s 'p Glu His Asp Leu Ser Val Leu Asp Tyr Leu Ser Asp Phe He Cys Cys 

{y 275 280 285 

l3>0 Leu Tyr Gly Val Pro Ser Ala Tyr Gly Val Val Thr Met Pro Phe Ser 
290 295 300 

Val Arg Glu Gly He Asn He Phe Leu Asp Gly Tyr Val Pro Thr Glu 
H 305 310 315 320 

.: — z 
s : 

If. Asn Leu Arg Phe Arg Asp Ala Ser Leu Val Phe Lys Val Ala Glu Thr 
H|5 325 330 335 

Ala Asn Glu Glu Glu Val Lys Lys Met Cys Met Tyr Lys Tyr Pro Gly 

340 345 350 

Met Lys Lys Lys Met Gly Glu Phe Glu Leu Ala He Val Ala Gly Glu 
355 360 365 

30 Phe Thr Asp Ser Glu He Met Val Met Leu Gly Glu Asn Gly Thr Gly 
370 375 380 

Lys Thr Thr Phe He Arg Met Leu Ala Gly Arg Leu Lys Pro Asp Glu 
385 390 395 400 

Gly Gly Glu Val Pro Val Leu Asn Val Ser Tyr Lys Pro Gin Lys He 
35 405 410 415 

Ser Pro Lys Ser Thr Gly Ser Val Arg Gin Leu Leu His Glu Lys He 

420 425 430 

Arg Asp Ala Tyr Thr His Pro Gin Phe Val Thr Asp Val Met Lys Pro 
435 440 445 

40 Leu Gin He Glu Asn He He Asp Gin Glu Val Gin Thr Leu Ser Gly 
450 455 460 
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Gly Glu Leu Gin Arg Val Arg Leu Arg Leu Cys Leu Gly Lys Pro Ala 
465 470 475 480 

Asp Val Tyr Leu lie Asp Glu Pro Ser Ala Tyr Leu Asp Ser Glu Gin 
485 490 495 

5 Arg Leu Met Ala Ala Arg Val Val Lys Arg Phe lie Leu His Ala Lys 
500 505 510 

Lys Thr Ala Phe Val Val Glu His Asp Phe lie Met Ala Thr Tyr Leu 

515 520 525 

Ala Asp Arg Val lie Val Phe Asp Gly Val Pro Ser Lys Asn Thr Val 
10 530 535 540 

Ala Asn Ser Pro Gin Thr Leu Leu Ala Gly Met Asn Lys Phe Leu Ser 
545 550 555 .560 

Gin Leu Glu lie Thr Phe Arg Arg Asp Pro Asn Asn Tyr Arg Pro Arg 
565 570 575 

15 He Asn Lys Leu Asn Ser He Lys Asp Val Glu Gin Lys Lys Ser Gly 
580 585 590 

Asn Tyr Phe Phe Leu Asp Asp 



595 
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WHAT IS CLAIMED IS: 
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